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3 <110> APPLICANT: The Scripps Research Institute 

4 Deiters, Alexander 

5 Cropp, T Ashton 

6 Chin, Jason W 

7 Anderson, J Christopher 

8 Schultz, Peter G 

10 <120> TITLE OF INVENTION: UNNATURAL REACTIVE AMINO ACID GENETIC CODE ADDITIONS 

12 <130> FILE REFERENCE: 54 - 000250US/PC 
C--> 14 <140> CURRENT APPLICATION NUMBER: US/10/561,121 
C--> 14 <141> CURRENT FILING DATE: 2005-12-13 

14 <160> NUMBER OF SEQ ID NOS : 104 

16 <170> SOFTWARE: Patentln version 3.3 

18 <210> SEQ ID NO: 1 

19 <211> LENGTH: 1275 

20 <212> TYPE: DNA 

21 <213> ORGANISM: Escherichia coli 

23 <400> SEQUENCE: 1 

24 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 
26 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcgctcta ttgcggcttc 120 
28 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 
30 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 
32 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 300 
34 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 
36 gctatcgcgg cgaacaacta tgactggttc ggcaatatga atgtgctgac cttcctgcgc 42 0 
38 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 480 
40 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacaacct gttgcagggt 540 
42 tatgacttcg cctgtctgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 
44 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 
46 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 
48 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 
50 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 
52 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 
54 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 96 0 
56 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 
58 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 
60 caggcactgg tcgattctga actgcaacct tcccgtggtc aggcacgtaa aactatcgcc 1140 
62 tccaatgcca tcaccattaa cggtgaaaaa cagtccgatc ctgaatactt ctttaaagaa 1200 
64 gaagatcgtc tgtttggtcg ttttacctta ctgcgtcgcg gtaaaaagaa ttactgtctg 1260 
66 atttgctgga aataa 1275 

69 <210> SEQ ID NO: 2 

70 <211> LENGTH: 424 

71 <212> TYPE: PRT 

72 <213> ORGANISM: Escherichia coli 
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74 <400> SEQUENCE: 2 
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172 Thr lie Asn Gly Glu Lys Gin Ser Asp Pro Glu Tyr Phe, Phe Lys Glu 

173 385 390 395 400 

176 Glu Asp Arg Leu Phe Gly Arg Phe Thr Leu Leu Arg Arg Gly Lys Lys 

177 405 410 415 

180 Asn Tyr Cys Leu lie Cys Trp Lys 

181 420 

184 <210> SEQ ID NO: 3 

185 <211> LENGTH: 1275 

186 <212> TYPE: DNA 

187 <213> ORGANISM: Artificial 

189 <220> FEATURE: 

190 <223> OTHER INFORMATION: artificial synthetase 

192 <400> SEQUENCE: 3 

193 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 
195 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcactcgt gtgtggcttc 120 
197 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 
199 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 
201 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 300 
203 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 
205 gctatcgcgg ccaataatta tgactggttc ggcaatatga atgtgctgac cttcctgcgc 420 
207 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 480 
209 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacaacct gctgcagggt 540 
211 tatagtatgg cctgtttgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 
213 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 
215 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 
217 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 
219 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 
221 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 
223 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 960 
225 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 
227 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 
229 caggcactgg tcgattctga actgcaacct tcccgtggtc aggcacgtaa aactatcgcc 1140 
231 tccaatgcca tcaccattaa cggtgaaaaa cagtccgatc ctgaatactt ctttaaagaa 1200 
233 gaagatcgtc tgtttggtcg ttttacctta ctgcgtcgcg gtaaaaagaa ttactgtctg 1260 
235 atttgctgga aataa 1275 

238 <210> SEQ ID NO: 4 

239 <211> LENGTH: 1275 

240 <212> TYPE: DNA 

241 <213> ORGANISM: artificial 

243 <220> FEATURE: 

244 <223> OTHER INFORMATION: artificial synthetase 

246 <400> SEQUENCE: 4 

247 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 
249 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcactcac ttgtggcttc 120 
251 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 
253 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 
255 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 300 
257 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 
259 gctatcgcgg ccaataatta tgactggttc agcaatatga atgtgctgac cttcctgcgc 420 
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261 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 480 

263 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacaacct gctgcagggt 540 

265 tatacgtatg cctgtctgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 

267 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 

269 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 

271 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 

273 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 

275 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 

277 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 960 

279 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 

281 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 

283 caggcactgg tcgattctga actgcaacct tcccgtggtc aggcacgtaa aactatcgcc 1140 

285 tccaatgcca tcaccattaa cggtgaaaaa cagtccgatc ctgaatactt ctttaaagaa 1200 

287 gaagatcgtc tgtttggtcg ttttacctta ctgcgtcgcg gtaaaaagaa ttactgtctg 1260 

289 atttgctgga aataa 1275 

292 <210> SEQ ID NO: 5 

293 <211> LENGTH: 1275 

2 94 <212> TYPE:' DNA 

295 <213> ORGANISM: artificial 

297 <220> FEATURE: 

298 <223> OTHER INFORMATION: artificial synthetase 
300 <400> SEQUENCE: 5 

3 01 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 
303 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcactcgt gtgtggcttc 120 
305 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 
307 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 
309 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 300 
311 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 
313 gctatcgcgg ccaataatta tgactggttc ggcaatatga atgtgctgac cttcctgcgc 420 
315 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 4 80 
317 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacaacct gctgcagggt 540 
319 tatagtatgg cctgtttgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 
321 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 
323 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 
325 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 
327 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 
329 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 
331 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 960 
333 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 
335 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 
337 caggcactgg tcgattctga actgcaacct tcccgtggtc aggcacgtaa aactatcgcc 1140 
33 9 tccaatgcca tcaccattaa cggtgaaaaa cagtccgatc ctgaatactt ctttaaagaa 1200 
341 gaagatcgtc tgtttggtcg ttttacctta ctgcgtcgcg gtaaaaagaa ttactgtctg 1260 
343 atttgctgga aataa 1275 

346 <210> SEQ ID NO: 6 

347 <211> LENGTH: 1275 

348 <212> TYPE: DNA 

349 <213> ORGANISM: artificial 
351 <220> FEATURE: 
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352 <223> OTHER INFORMATION: artificial synthetase 

354 <400> SEQUENCE: 6 

355 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 
357 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcactcgt gtgtggcttc 12 0 
359 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 
361 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 
3 63 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 3 00 
365 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 
367 gctatcgcgg ccaataatta tgactggttc ggcaatatga atgtgctgac cttcctgcgc 420 
369 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 480 
371 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacaacct gctgcagggt 540 
373 tatagtatgg cctgtttgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 
375 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 
377 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 
379 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 
381 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 
3 83 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 
385 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 960 
387 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 
389 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 
391 caggcactgg tcgattctga actgcaacct tcccgtggtc aggcacgtaa aactatcgcc 1140 
3 93 tccaatgcca tcaccattaa cggtgaaaaa cagtccgatc ctgaatactt ctttaaagaa 1200 
395 gaagatcgtc tgtttggtcg ttttacctta ctgcgtcgcg gtaaaaagaa ttactgtctg 1260 
397 atttgctgga aataa 1275 

400 <210> SEQ ID NO: 7 

401 <211> LENGTH: 1275 

402 <212> TYPE: DNA 

403 <213> ORGANISM: artificial 

405 <220> FEATURE: 

406 <223> OTHER INFORMATION: artificial synthetase 
408 <400> SEQUENCE: 7 

40 9 atggcaagca gtaacttgat taaacaattg caagagcggg ggctggtagc ccaggtgacg 60 

411 gacgaggaag cgttagcaga gcgactggcg caaggcccga tcgcactcac gtgtggcttc 120 

413 gatcctaccg ctgacagctt gcatttgggg catcttgttc cattgttatg cctgaaacgc 180 

415 ttccagcagg cgggccacaa gccggttgcg ctggtaggcg gcgcgacggg tctgattggc 240 

417 gacccgagct tcaaagctgc cgagcgtaag ctgaacaccg aagaaactgt tcaggagtgg 300 

419 gtggacaaaa tccgtaagca ggttgccccg ttcctcgatt tcgactgtgg agaaaactct 360 

421 gctatcgcgg ccaataatta tgactggttc ggcaatatga atgtgctgac cttcctgcgc 420 

423 gatattggca aacacttctc cgttaaccag atgatcaaca aagaagcggt taagcagcgt 480 

425 ctcaaccgtg aagatcaggg gatttcgttc actgagtttt cctacagcct gctgcagggt 540 

427 tatacgatgg cctgtctgaa caaacagtac ggtgtggtgc tgcaaattgg tggttctgac 600 

429 cagtggggta acatcacttc tggtatcgac ctgacccgtc gtctgcatca gaatcaggtg 660 

431 tttggcctga ccgttccgct gatcactaaa gcagatggca ccaaatttgg taaaactgaa 720 

433 ggcggcgcag tctggttgga tccgaagaaa accagcccgt acaaattcta ccagttctgg 780 

435 atcaacactg cggatgccga cgtttaccgc ttcctgaagt tcttcacctt tatgagcatt 840 

43 7 gaagagatca acgccctgga agaagaagat aaaaacagcg gtaaagcacc gcgcgcccag 900 

43 9 tatgtactgg cggagcaggt gactcgtctg gttcacggtg aagaaggttt acaggcggca 960 

441 aaacgtatta ccgaatgcct gttcagcggt tctttgagtg cgctgagtga agcggacttc 1020 

443 gaacagctgg cgcaggacgg cgtaccgatg gttgagatgg aaaagggcgc agacctgatg 1080 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:20; N Pos . 26,612,618 

Seq#:22; N Pos . 403,513,515,518,531 

Seq#:23; N Pos. 588 

Seq#:26; N Pos. 13,599 

Seq#:27; N Pos . 600 

Seq#:34; N Pos . 13 

Seq#:87; Xaa Pos. 2 

Seq#:88; N Pos. 8 

Seq#: 91; N Pos . 1, 14 

Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#: 3, 4 ,5, 6 ,7, 8, 9 ,10, 11 ,12 ,13, 14 ,15, 16 ,17 ,18 ,19, 2 0 ,21 ,22, 2 3, 24 ,25 ,26 ,27, 2 8, 2 9 
Seq#: 3 0,3 1,32, 33, 34, 35, 36, 3 7, 38, 3 9, 40, 41, 42, 43, 44, 45, 46 ,47, 48, 49, 50, 51, 52,53 
Seq#: 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79 
Seq#: 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94; 95, 96, 97, 98, 99, 100, 101, 102 
Seq#:103,104 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/561,121 



DATE: 01/03/2006 
TIME: 11:36:12 



L:14 M:270 C: 
L:14 M:271 C: 
L:825 M:341 W 
L:845 M:341 W 
L:925 M:341 W 
L:929 M:341 W 
L:967 M:341 W 
L:1050 M:341 W 
L:1068 M:341 W 
L:1106 M:341 W 
L:1318 M:341 W 
L:5080 M:341 W 
L:5099 M:341 W 
L:5150 M:341 W 



Input Set : F:\54-000250US.ST25.txt 
Output Set: N:\CRF4\01032006\J561121.raw 

Current Application Number differs, Replaced Current Application No 
Current Filing Date differs, Replaced Current Filing Date 
* " used, for SEQ ID# : 20 after pos . : 0 

used, for SEQ ID#:20 after pos.: 600 
used, for SEQ ID#:22 after pos.: 360 
used, for SEQ ID#:22 after pos.:480 
used, for SEQ ID#:23 after pos.: 540 
used, for SEQ ID#:26 after pos . : 0 
used, for SEQ ID#:26 after pos.: 540 
used, for SEQ ID#:27 after pos. 
used, for SEQ ID# : 
used, for SEQ ID# : 
used, for SEQ ID# : 
used, for SEQ ID# ; 



46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 
(46) 



'n' 

■n" 

•n" 

»n" 

•n" 

»n" 
it n i. 

"n" 
"n" 

"n" 
"n" 



or 
or 
or 
or 
or 



or 
or 
or 
or 
or 
or 
or 



'Xaa» 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa' 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 
"Xaa" 



:34 after pos. 
:87 after pos. 
:88 after pos. 
: 91 after pos . 



:540 
:0 
: 0 
:0 
: 0 
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